Dataset statistics
| Number of variables | 17 |
|---|---|
| Number of observations | 25000 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 2.3 MiB |
| Average record size in memory | 96.0 B |
Variable types
| Numeric | 12 |
|---|---|
| Categorical | 5 |
change is highly overall correlated with diabetes_med | High correlation |
diabetes_med is highly overall correlated with change | High correlation |
glucose_test is highly imbalanced (77.1%) | Imbalance |
A1Ctest is highly imbalanced (50.5%) | Imbalance |
n_emergency is highly skewed (γ1 = 24.53015169) | Skewed |
age has 2532 (10.1%) zeros | Zeros |
n_procedures has 11409 (45.6%) zeros | Zeros |
n_outpatient has 20859 (83.4%) zeros | Zeros |
n_inpatient has 16537 (66.1%) zeros | Zeros |
n_emergency has 22272 (89.1%) zeros | Zeros |
medical_specialty has 1409 (5.6%) zeros | Zeros |
diag_1 has 7824 (31.3%) zeros | Zeros |
diag_2 has 8134 (32.5%) zeros | Zeros |
diag_3 has 7686 (30.7%) zeros | Zeros |
Reproduction
| Analysis started | 2024-08-28 15:57:40.630700 |
|---|---|
| Analysis finished | 2024-08-28 15:58:29.325423 |
| Duration | 48.69 seconds |
| Software version | ydata-profiling vv4.9.0 |
| Download configuration | config.json |
age
Real number (ℝ)
ZEROS 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.34412 |
| Minimum | 0 |
|---|---|
| Maximum | 5 |
| Zeros | 2532 |
| Zeros (%) | 10.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 97.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 2 |
| Q3 | 3 |
| 95-th percentile | 4 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.3156332 |
|---|---|
| Coefficient of variation (CV) | 0.56124822 |
| Kurtosis | -0.81550571 |
| Mean | 2.34412 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | -0.12617834 |
| Sum | 58603 |
| Variance | 1.7308907 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 6837 | |
| 2 | 5913 | |
| 4 | 4516 | |
| 1 | 4452 | |
| 0 | 2532 | 10.1% |
| 5 | 750 | 3.0% |
| Value | Count | Frequency (%) |
| 0 | 2532 | 10.1% |
| 1 | 4452 | |
| 2 | 5913 | |
| 3 | 6837 | |
| 4 | 4516 | |
| 5 | 750 | 3.0% |
| Value | Count | Frequency (%) |
| 5 | 750 | 3.0% |
| 4 | 4516 | |
| 3 | 6837 | |
| 2 | 5913 | |
| 1 | 4452 | |
| 0 | 2532 | 10.1% |
time_in_hospital
Real number (ℝ)
| Distinct | 14 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.45332 |
| Minimum | 1 |
|---|---|
| Maximum | 14 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 195.4 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 4 |
| Q3 | 6 |
| 95-th percentile | 11 |
| Maximum | 14 |
| Range | 13 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 3.0014699 |
|---|---|
| Coefficient of variation (CV) | 0.67398477 |
| Kurtosis | 0.800598 |
| Mean | 4.45332 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 1.1089046 |
| Sum | 111333 |
| Variance | 9.0088213 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 4311 | |
| 2 | 3986 | |
| 1 | 3480 | |
| 4 | 3467 | |
| 5 | 2542 | |
| 6 | 1895 | |
| 7 | 1467 | 5.9% |
| 8 | 1104 | 4.4% |
| 9 | 768 | 3.1% |
| 10 | 588 | 2.4% |
| Other values (4) | 1392 | 5.6% |
| Value | Count | Frequency (%) |
| 1 | 3480 | |
| 2 | 3986 | |
| 3 | 4311 | |
| 4 | 3467 | |
| 5 | 2542 | |
| 6 | 1895 | |
| 7 | 1467 | 5.9% |
| 8 | 1104 | 4.4% |
| 9 | 768 | 3.1% |
| 10 | 588 | 2.4% |
| Value | Count | Frequency (%) |
| 14 | 281 | 1.1% |
| 13 | 299 | 1.2% |
| 12 | 354 | 1.4% |
| 11 | 458 | 1.8% |
| 10 | 588 | 2.4% |
| 9 | 768 | 3.1% |
| 8 | 1104 | |
| 7 | 1467 | |
| 6 | 1895 | |
| 5 | 2542 |
n_lab_procedures
Real number (ℝ)
| Distinct | 109 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 43.24076 |
| Minimum | 1 |
|---|---|
| Maximum | 113 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 195.4 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 31 |
| median | 44 |
| Q3 | 57 |
| 95-th percentile | 74 |
| Maximum | 113 |
| Range | 112 |
| Interquartile range (IQR) | 26 |
Descriptive statistics
| Standard deviation | 19.81862 |
|---|---|
| Coefficient of variation (CV) | 0.45833191 |
| Kurtosis | -0.29792189 |
| Mean | 43.24076 |
| Median Absolute Deviation (MAD) | 13 |
| Skewness | -0.23867244 |
| Sum | 1081019 |
| Variance | 392.77771 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 751 | 3.0% |
| 43 | 638 | 2.6% |
| 44 | 589 | 2.4% |
| 45 | 572 | 2.3% |
| 38 | 547 | 2.2% |
| 39 | 538 | 2.2% |
| 41 | 535 | 2.1% |
| 49 | 521 | 2.1% |
| 46 | 515 | 2.1% |
| 40 | 513 | 2.1% |
| Other values (99) | 19281 |
| Value | Count | Frequency (%) |
| 1 | 751 | |
| 2 | 268 | 1.1% |
| 3 | 173 | 0.7% |
| 4 | 102 | 0.4% |
| 5 | 79 | 0.3% |
| 6 | 68 | 0.3% |
| 7 | 80 | 0.3% |
| 8 | 90 | 0.4% |
| 9 | 251 | 1.0% |
| 10 | 219 | 0.9% |
| Value | Count | Frequency (%) |
| 113 | 1 | < 0.1% |
| 111 | 1 | < 0.1% |
| 109 | 1 | < 0.1% |
| 108 | 2 | < 0.1% |
| 106 | 2 | < 0.1% |
| 105 | 2 | < 0.1% |
| 103 | 1 | < 0.1% |
| 102 | 1 | < 0.1% |
| 101 | 5 | |
| 100 | 2 | < 0.1% |
n_procedures
Real number (ℝ)
ZEROS 
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.35236 |
| Minimum | 0 |
|---|---|
| Maximum | 6 |
| Zeros | 11409 |
| Zeros (%) | 45.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 195.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 5 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.7151793 |
|---|---|
| Coefficient of variation (CV) | 1.268286 |
| Kurtosis | 0.79556662 |
| Mean | 1.35236 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.3005718 |
| Sum | 33809 |
| Variance | 2.9418401 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 11409 | |
| 1 | 5098 | |
| 2 | 3064 | 12.3% |
| 3 | 2395 | 9.6% |
| 6 | 1227 | 4.9% |
| 4 | 999 | 4.0% |
| 5 | 808 | 3.2% |
| Value | Count | Frequency (%) |
| 0 | 11409 | |
| 1 | 5098 | |
| 2 | 3064 | 12.3% |
| 3 | 2395 | 9.6% |
| 4 | 999 | 4.0% |
| 5 | 808 | 3.2% |
| 6 | 1227 | 4.9% |
| Value | Count | Frequency (%) |
| 6 | 1227 | 4.9% |
| 5 | 808 | 3.2% |
| 4 | 999 | 4.0% |
| 3 | 2395 | 9.6% |
| 2 | 3064 | 12.3% |
| 1 | 5098 | |
| 0 | 11409 |
n_medications
Real number (ℝ)
| Distinct | 70 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 16.2524 |
| Minimum | 1 |
|---|---|
| Maximum | 79 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 195.4 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 6 |
| Q1 | 11 |
| median | 15 |
| Q3 | 20 |
| 95-th percentile | 31 |
| Maximum | 79 |
| Range | 78 |
| Interquartile range (IQR) | 9 |
Descriptive statistics
| Standard deviation | 8.0605318 |
|---|---|
| Coefficient of variation (CV) | 0.49595948 |
| Kurtosis | 3.4765952 |
| Mean | 16.2524 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | 1.316139 |
| Sum | 406310 |
| Variance | 64.972173 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 12 | 1509 | 6.0% |
| 15 | 1469 | 5.9% |
| 13 | 1459 | 5.8% |
| 11 | 1396 | 5.6% |
| 14 | 1396 | 5.6% |
| 16 | 1379 | 5.5% |
| 17 | 1271 | 5.1% |
| 10 | 1268 | 5.1% |
| 9 | 1157 | 4.6% |
| 18 | 1140 | 4.6% |
| Other values (60) | 11556 |
| Value | Count | Frequency (%) |
| 1 | 66 | 0.3% |
| 2 | 83 | 0.3% |
| 3 | 198 | 0.8% |
| 4 | 275 | 1.1% |
| 5 | 419 | 1.7% |
| 6 | 631 | |
| 7 | 828 | |
| 8 | 1052 | |
| 9 | 1157 | |
| 10 | 1268 |
| Value | Count | Frequency (%) |
| 79 | 1 | < 0.1% |
| 75 | 1 | < 0.1% |
| 72 | 1 | < 0.1% |
| 69 | 2 | < 0.1% |
| 68 | 2 | < 0.1% |
| 65 | 2 | < 0.1% |
| 64 | 1 | < 0.1% |
| 63 | 5 | |
| 62 | 3 | |
| 61 | 4 |
n_outpatient
Real number (ℝ)
ZEROS 
| Distinct | 23 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.3664 |
| Minimum | 0 |
|---|---|
| Maximum | 33 |
| Zeros | 20859 |
| Zeros (%) | 83.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 195.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 33 |
| Range | 33 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1.1954782 |
|---|---|
| Coefficient of variation (CV) | 3.2627681 |
| Kurtosis | 95.925322 |
| Mean | 0.3664 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 7.3026053 |
| Sum | 9160 |
| Variance | 1.4291682 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 20859 | |
| 1 | 2076 | 8.3% |
| 2 | 913 | 3.7% |
| 3 | 537 | 2.1% |
| 4 | 269 | 1.1% |
| 5 | 136 | 0.5% |
| 6 | 74 | 0.3% |
| 7 | 39 | 0.2% |
| 8 | 18 | 0.1% |
| 11 | 16 | 0.1% |
| Other values (13) | 63 | 0.3% |
| Value | Count | Frequency (%) |
| 0 | 20859 | |
| 1 | 2076 | 8.3% |
| 2 | 913 | 3.7% |
| 3 | 537 | 2.1% |
| 4 | 269 | 1.1% |
| 5 | 136 | 0.5% |
| 6 | 74 | 0.3% |
| 7 | 39 | 0.2% |
| 8 | 18 | 0.1% |
| 9 | 13 | 0.1% |
| Value | Count | Frequency (%) |
| 33 | 1 | < 0.1% |
| 27 | 2 | < 0.1% |
| 23 | 1 | < 0.1% |
| 21 | 3 | |
| 20 | 2 | < 0.1% |
| 18 | 2 | < 0.1% |
| 16 | 2 | < 0.1% |
| 15 | 5 | |
| 14 | 7 | |
| 13 | 7 |
n_inpatient
Real number (ℝ)
ZEROS 
| Distinct | 16 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.61596 |
| Minimum | 0 |
|---|---|
| Maximum | 15 |
| Zeros | 16537 |
| Zeros (%) | 66.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 195.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 3 |
| Maximum | 15 |
| Range | 15 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.1779511 |
|---|---|
| Coefficient of variation (CV) | 1.9123825 |
| Kurtosis | 16.454233 |
| Mean | 0.61596 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 3.2546341 |
| Sum | 15399 |
| Variance | 1.3875688 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 16537 | |
| 1 | 4926 | 19.7% |
| 2 | 1909 | 7.6% |
| 3 | 833 | 3.3% |
| 4 | 358 | 1.4% |
| 5 | 211 | 0.8% |
| 6 | 104 | 0.4% |
| 7 | 47 | 0.2% |
| 8 | 26 | 0.1% |
| 9 | 20 | 0.1% |
| Other values (6) | 29 | 0.1% |
| Value | Count | Frequency (%) |
| 0 | 16537 | |
| 1 | 4926 | 19.7% |
| 2 | 1909 | 7.6% |
| 3 | 833 | 3.3% |
| 4 | 358 | 1.4% |
| 5 | 211 | 0.8% |
| 6 | 104 | 0.4% |
| 7 | 47 | 0.2% |
| 8 | 26 | 0.1% |
| 9 | 20 | 0.1% |
| Value | Count | Frequency (%) |
| 15 | 2 | < 0.1% |
| 14 | 2 | < 0.1% |
| 13 | 2 | < 0.1% |
| 12 | 3 | < 0.1% |
| 11 | 8 | < 0.1% |
| 10 | 12 | < 0.1% |
| 9 | 20 | 0.1% |
| 8 | 26 | 0.1% |
| 7 | 47 | |
| 6 | 104 |
n_emergency
Real number (ℝ)
SKEWED  ZEROS 
| Distinct | 21 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.1866 |
| Minimum | 0 |
|---|---|
| Maximum | 64 |
| Zeros | 22272 |
| Zeros (%) | 89.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 195.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 64 |
| Range | 64 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.88587348 |
|---|---|
| Coefficient of variation (CV) | 4.7474463 |
| Kurtosis | 1310.5933 |
| Mean | 0.1866 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 24.530152 |
| Sum | 4665 |
| Variance | 0.78477183 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 22272 | |
| 1 | 1842 | 7.4% |
| 2 | 525 | 2.1% |
| 3 | 167 | 0.7% |
| 4 | 83 | 0.3% |
| 5 | 40 | 0.2% |
| 7 | 18 | 0.1% |
| 6 | 18 | 0.1% |
| 9 | 6 | < 0.1% |
| 8 | 6 | < 0.1% |
| Other values (11) | 23 | 0.1% |
| Value | Count | Frequency (%) |
| 0 | 22272 | |
| 1 | 1842 | 7.4% |
| 2 | 525 | 2.1% |
| 3 | 167 | 0.7% |
| 4 | 83 | 0.3% |
| 5 | 40 | 0.2% |
| 6 | 18 | 0.1% |
| 7 | 18 | 0.1% |
| 8 | 6 | < 0.1% |
| 9 | 6 | < 0.1% |
| Value | Count | Frequency (%) |
| 64 | 1 | < 0.1% |
| 37 | 1 | < 0.1% |
| 28 | 1 | < 0.1% |
| 21 | 1 | < 0.1% |
| 19 | 2 | |
| 18 | 3 | |
| 16 | 2 | |
| 13 | 1 | < 0.1% |
| 12 | 2 | |
| 11 | 3 |
medical_specialty
Real number (ℝ)
ZEROS 
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.4588 |
| Minimum | 0 |
|---|---|
| Maximum | 6 |
| Zeros | 1409 |
| Zeros (%) | 5.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 97.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 3 |
| median | 4 |
| Q3 | 4 |
| 95-th percentile | 5 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.4254486 |
|---|---|
| Coefficient of variation (CV) | 0.41212231 |
| Kurtosis | 0.3519299 |
| Mean | 3.4588 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | -0.84100624 |
| Sum | 86470 |
| Variance | 2.0319038 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4 | 12382 | |
| 3 | 3565 | 14.3% |
| 5 | 2664 | 10.7% |
| 1 | 1885 | 7.5% |
| 2 | 1882 | 7.5% |
| 0 | 1409 | 5.6% |
| 6 | 1213 | 4.9% |
| Value | Count | Frequency (%) |
| 0 | 1409 | 5.6% |
| 1 | 1885 | 7.5% |
| 2 | 1882 | 7.5% |
| 3 | 3565 | 14.3% |
| 4 | 12382 | |
| 5 | 2664 | 10.7% |
| 6 | 1213 | 4.9% |
| Value | Count | Frequency (%) |
| 6 | 1213 | 4.9% |
| 5 | 2664 | 10.7% |
| 4 | 12382 | |
| 3 | 3565 | 14.3% |
| 2 | 1882 | 7.5% |
| 1 | 1885 | 7.5% |
| 0 | 1409 | 5.6% |
diag_1
Real number (ℝ)
ZEROS 
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.29708 |
| Minimum | 0 |
|---|---|
| Maximum | 7 |
| Zeros | 7824 |
| Zeros (%) | 31.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 97.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 3 |
| Q3 | 6 |
| 95-th percentile | 7 |
| Maximum | 7 |
| Range | 7 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 2.8277806 |
|---|---|
| Coefficient of variation (CV) | 0.85766212 |
| Kurtosis | -1.7306832 |
| Mean | 3.29708 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 0.02586226 |
| Sum | 82427 |
| Variance | 7.9963433 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 7824 | |
| 6 | 6498 | |
| 7 | 3680 | |
| 2 | 2329 | 9.3% |
| 1 | 1747 | 7.0% |
| 3 | 1666 | 6.7% |
| 5 | 1252 | 5.0% |
| 4 | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 7824 | |
| 1 | 1747 | 7.0% |
| 2 | 2329 | 9.3% |
| 3 | 1666 | 6.7% |
| 4 | 4 | < 0.1% |
| 5 | 1252 | 5.0% |
| 6 | 6498 | |
| 7 | 3680 |
| Value | Count | Frequency (%) |
| 7 | 3680 | |
| 6 | 6498 | |
| 5 | 1252 | 5.0% |
| 4 | 4 | < 0.1% |
| 3 | 1666 | 6.7% |
| 2 | 2329 | 9.3% |
| 1 | 1747 | 7.0% |
| 0 | 7824 |
diag_2
Real number (ℝ)
ZEROS 
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.33452 |
| Minimum | 0 |
|---|---|
| Maximum | 7 |
| Zeros | 8134 |
| Zeros (%) | 32.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 97.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 3 |
| Q3 | 6 |
| 95-th percentile | 7 |
| Maximum | 7 |
| Range | 7 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 2.9135264 |
|---|---|
| Coefficient of variation (CV) | 0.87374686 |
| Kurtosis | -1.8487802 |
| Mean | 3.33452 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | -0.042003843 |
| Sum | 83363 |
| Variance | 8.4886359 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 6 | 9056 | |
| 0 | 8134 | |
| 1 | 2906 | 11.6% |
| 7 | 2872 | 11.5% |
| 2 | 973 | 3.9% |
| 3 | 591 | 2.4% |
| 5 | 426 | 1.7% |
| 4 | 42 | 0.2% |
| Value | Count | Frequency (%) |
| 0 | 8134 | |
| 1 | 2906 | 11.6% |
| 2 | 973 | 3.9% |
| 3 | 591 | 2.4% |
| 4 | 42 | 0.2% |
| 5 | 426 | 1.7% |
| 6 | 9056 | |
| 7 | 2872 | 11.5% |
| Value | Count | Frequency (%) |
| 7 | 2872 | 11.5% |
| 6 | 9056 | |
| 5 | 426 | 1.7% |
| 4 | 42 | 0.2% |
| 3 | 591 | 2.4% |
| 2 | 973 | 3.9% |
| 1 | 2906 | 11.6% |
| 0 | 8134 |
diag_3
Real number (ℝ)
ZEROS 
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.14364 |
| Minimum | 0 |
|---|---|
| Maximum | 7 |
| Zeros | 7686 |
| Zeros (%) | 30.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 97.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 2 |
| Q3 | 6 |
| 95-th percentile | 7 |
| Maximum | 7 |
| Range | 7 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 2.8372186 |
|---|---|
| Coefficient of variation (CV) | 0.90252657 |
| Kurtosis | -1.8411175 |
| Mean | 3.14364 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.075262148 |
| Sum | 78591 |
| Variance | 8.0498095 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 6 | 9107 | |
| 0 | 7686 | |
| 1 | 4261 | |
| 7 | 1915 | 7.7% |
| 2 | 916 | 3.7% |
| 3 | 464 | 1.9% |
| 5 | 455 | 1.8% |
| 4 | 196 | 0.8% |
| Value | Count | Frequency (%) |
| 0 | 7686 | |
| 1 | 4261 | |
| 2 | 916 | 3.7% |
| 3 | 464 | 1.9% |
| 4 | 196 | 0.8% |
| 5 | 455 | 1.8% |
| 6 | 9107 | |
| 7 | 1915 | 7.7% |
| Value | Count | Frequency (%) |
| 7 | 1915 | 7.7% |
| 6 | 9107 | |
| 5 | 455 | 1.8% |
| 4 | 196 | 0.8% |
| 3 | 464 | 1.9% |
| 2 | 916 | 3.7% |
| 1 | 4261 | |
| 0 | 7686 |
glucose_test
Categorical
IMBALANCE 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
| 1 | |
|---|---|
| 2 | 689 |
| 0 | 686 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 25000 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 23625 | |
| 2 | 689 | 2.8% |
| 0 | 686 | 2.7% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 23625 | |
| 2 | 689 | 2.8% |
| 0 | 686 | 2.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 23625 | |
| 2 | 689 | 2.8% |
| 0 | 686 | 2.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 25000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 23625 | |
| 2 | 689 | 2.8% |
| 0 | 686 | 2.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 25000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 23625 | |
| 2 | 689 | 2.8% |
| 0 | 686 | 2.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 25000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 23625 | |
| 2 | 689 | 2.8% |
| 0 | 686 | 2.7% |
A1Ctest
Categorical
IMBALANCE 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
| 1 | |
|---|---|
| 0 | |
| 2 | 1235 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 25000 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 20938 | |
| 0 | 2827 | 11.3% |
| 2 | 1235 | 4.9% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 20938 | |
| 0 | 2827 | 11.3% |
| 2 | 1235 | 4.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 20938 | |
| 0 | 2827 | 11.3% |
| 2 | 1235 | 4.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 25000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 20938 | |
| 0 | 2827 | 11.3% |
| 2 | 1235 | 4.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 25000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 20938 | |
| 0 | 2827 | 11.3% |
| 2 | 1235 | 4.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 25000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 20938 | |
| 0 | 2827 | 11.3% |
| 2 | 1235 | 4.9% |
change
Categorical
HIGH CORRELATION 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 25000 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 13497 | |
| 1 | 11503 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 13497 | |
| 1 | 11503 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 13497 | |
| 1 | 11503 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 25000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 13497 | |
| 1 | 11503 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 25000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 13497 | |
| 1 | 11503 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 25000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 13497 | |
| 1 | 11503 |
diabetes_med
Categorical
HIGH CORRELATION 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
| 1 | |
|---|---|
| 0 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 25000 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 19228 | |
| 0 | 5772 | 23.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 19228 | |
| 0 | 5772 | 23.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 19228 | |
| 0 | 5772 | 23.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 25000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 19228 | |
| 0 | 5772 | 23.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 25000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 19228 | |
| 0 | 5772 | 23.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 25000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 19228 | |
| 0 | 5772 | 23.1% |
readmitted
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 25000 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 13246 | |
| 1 | 11754 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 13246 | |
| 1 | 11754 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 13246 | |
| 1 | 11754 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 25000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 13246 | |
| 1 | 11754 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 25000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 13246 | |
| 1 | 11754 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 25000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 13246 | |
| 1 | 11754 |
| A1Ctest | age | change | diabetes_med | diag_1 | diag_2 | diag_3 | glucose_test | medical_specialty | n_emergency | n_inpatient | n_lab_procedures | n_medications | n_outpatient | n_procedures | readmitted | time_in_hospital | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| A1Ctest | 1.000 | 0.070 | 0.108 | 0.081 | 0.085 | 0.043 | 0.015 | 0.052 | 0.062 | 0.000 | 0.034 | 0.188 | 0.024 | 0.016 | 0.042 | 0.023 | 0.053 |
| age | 0.070 | 1.000 | 0.055 | 0.035 | -0.024 | -0.022 | -0.031 | 0.033 | -0.044 | -0.038 | 0.015 | 0.035 | -0.035 | 0.020 | -0.099 | 0.042 | 0.101 |
| change | 0.108 | 0.055 | 1.000 | 0.506 | 0.110 | 0.063 | 0.035 | 0.058 | 0.084 | 0.000 | 0.019 | 0.066 | 0.242 | 0.015 | 0.019 | 0.043 | 0.114 |
| diabetes_med | 0.081 | 0.035 | 0.506 | 1.000 | 0.088 | 0.054 | 0.031 | 0.052 | 0.034 | 0.013 | 0.031 | 0.039 | 0.207 | 0.000 | 0.021 | 0.062 | 0.072 |
| diag_1 | 0.085 | -0.024 | 0.110 | 0.088 | 1.000 | 0.190 | 0.095 | 0.040 | 0.069 | 0.036 | 0.023 | -0.007 | -0.021 | 0.020 | -0.218 | 0.056 | 0.026 |
| diag_2 | 0.043 | -0.022 | 0.063 | 0.054 | 0.190 | 1.000 | 0.088 | 0.033 | 0.044 | 0.056 | 0.032 | 0.057 | 0.027 | 0.024 | -0.100 | 0.032 | 0.091 |
| diag_3 | 0.015 | -0.031 | 0.035 | 0.031 | 0.095 | 0.088 | 1.000 | 0.020 | 0.022 | 0.032 | 0.030 | 0.067 | 0.016 | 0.016 | -0.044 | 0.039 | 0.077 |
| glucose_test | 0.052 | 0.033 | 0.058 | 0.052 | 0.040 | 0.033 | 0.020 | 1.000 | 0.082 | 0.000 | 0.021 | 0.272 | 0.030 | 0.037 | 0.054 | 0.015 | 0.034 |
| medical_specialty | 0.062 | -0.044 | 0.084 | 0.034 | 0.069 | 0.044 | 0.022 | 0.082 | 1.000 | -0.028 | 0.001 | -0.113 | 0.106 | 0.070 | 0.075 | 0.056 | 0.028 |
| n_emergency | 0.000 | -0.038 | 0.000 | 0.013 | 0.036 | 0.056 | 0.032 | 0.000 | -0.028 | 1.000 | 0.220 | 0.009 | 0.045 | 0.176 | -0.051 | 0.037 | -0.004 |
| n_inpatient | 0.034 | 0.015 | 0.019 | 0.031 | 0.023 | 0.032 | 0.030 | 0.021 | 0.001 | 0.220 | 1.000 | 0.044 | 0.101 | 0.168 | -0.068 | 0.191 | 0.094 |
| n_lab_procedures | 0.188 | 0.035 | 0.066 | 0.039 | -0.007 | 0.057 | 0.067 | 0.272 | -0.113 | 0.009 | 0.044 | 1.000 | 0.259 | -0.031 | 0.015 | 0.045 | 0.350 |
| n_medications | 0.024 | -0.035 | 0.242 | 0.207 | -0.021 | 0.027 | 0.016 | 0.030 | 0.106 | 0.045 | 0.101 | 0.259 | 1.000 | 0.071 | 0.337 | 0.088 | 0.450 |
| n_outpatient | 0.016 | 0.020 | 0.015 | 0.000 | 0.020 | 0.024 | 0.016 | 0.037 | 0.070 | 0.176 | 0.168 | -0.031 | 0.071 | 1.000 | -0.030 | 0.059 | -0.018 |
| n_procedures | 0.042 | -0.099 | 0.019 | 0.021 | -0.218 | -0.100 | -0.044 | 0.054 | 0.075 | -0.051 | -0.068 | 0.015 | 0.337 | -0.030 | 1.000 | 0.047 | 0.177 |
| readmitted | 0.023 | 0.042 | 0.043 | 0.062 | 0.056 | 0.032 | 0.039 | 0.015 | 0.056 | 0.037 | 0.191 | 0.045 | 0.088 | 0.059 | 0.047 | 1.000 | 0.053 |
| time_in_hospital | 0.053 | 0.101 | 0.114 | 0.072 | 0.026 | 0.091 | 0.077 | 0.034 | 0.028 | -0.004 | 0.094 | 0.350 | 0.450 | -0.018 | 0.177 | 0.053 | 1.000 |
| age | time_in_hospital | n_lab_procedures | n_procedures | n_medications | n_outpatient | n_inpatient | n_emergency | medical_specialty | diag_1 | diag_2 | diag_3 | glucose_test | A1Ctest | change | diabetes_med | readmitted | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 3 | 8 | 72 | 1 | 18 | 2 | 0 | 0 | 4 | 0 | 7 | 6 | 1 | 1 | 0 | 1 | 0 |
| 1 | 3 | 3 | 34 | 2 | 13 | 0 | 0 | 0 | 5 | 6 | 6 | 6 | 1 | 1 | 0 | 1 | 0 |
| 2 | 1 | 5 | 45 | 0 | 18 | 0 | 0 | 0 | 4 | 0 | 0 | 0 | 1 | 1 | 1 | 1 | 1 |
| 3 | 3 | 2 | 36 | 0 | 12 | 1 | 0 | 0 | 4 | 0 | 6 | 1 | 1 | 1 | 1 | 1 | 1 |
| 4 | 2 | 1 | 42 | 0 | 7 | 0 | 0 | 0 | 3 | 6 | 0 | 7 | 1 | 1 | 0 | 1 | 0 |
| 5 | 0 | 2 | 51 | 0 | 10 | 0 | 0 | 0 | 4 | 6 | 6 | 6 | 1 | 1 | 0 | 0 | 1 |
| 6 | 1 | 4 | 44 | 2 | 21 | 0 | 0 | 0 | 4 | 3 | 6 | 6 | 1 | 2 | 1 | 1 | 0 |
| 7 | 2 | 1 | 19 | 6 | 16 | 0 | 0 | 1 | 5 | 0 | 6 | 6 | 1 | 1 | 0 | 1 | 1 |
| 8 | 4 | 4 | 67 | 3 | 13 | 0 | 0 | 0 | 3 | 2 | 6 | 6 | 1 | 1 | 0 | 0 | 1 |
| 9 | 3 | 8 | 37 | 1 | 18 | 0 | 0 | 0 | 2 | 7 | 7 | 6 | 1 | 1 | 1 | 1 | 0 |
| age | time_in_hospital | n_lab_procedures | n_procedures | n_medications | n_outpatient | n_inpatient | n_emergency | medical_specialty | diag_1 | diag_2 | diag_3 | glucose_test | A1Ctest | change | diabetes_med | readmitted | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 24990 | 1 | 1 | 15 | 1 | 10 | 0 | 0 | 0 | 6 | 5 | 1 | 0 | 1 | 1 | 0 | 0 | 0 |
| 24991 | 2 | 4 | 45 | 4 | 21 | 0 | 0 | 0 | 4 | 6 | 6 | 3 | 1 | 2 | 1 | 1 | 1 |
| 24992 | 1 | 1 | 35 | 5 | 18 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | 0 | 0 | 0 |
| 24993 | 2 | 2 | 9 | 1 | 14 | 0 | 0 | 0 | 4 | 7 | 0 | 0 | 1 | 1 | 0 | 1 | 0 |
| 24994 | 2 | 2 | 61 | 4 | 11 | 0 | 0 | 0 | 4 | 0 | 0 | 0 | 1 | 1 | 1 | 1 | 0 |
| 24995 | 4 | 14 | 77 | 1 | 30 | 0 | 0 | 0 | 4 | 0 | 6 | 0 | 1 | 2 | 0 | 0 | 1 |
| 24996 | 4 | 2 | 66 | 0 | 24 | 0 | 0 | 0 | 4 | 2 | 3 | 6 | 1 | 0 | 1 | 1 | 1 |
| 24997 | 3 | 5 | 12 | 0 | 6 | 0 | 1 | 0 | 4 | 6 | 6 | 6 | 2 | 1 | 0 | 0 | 1 |
| 24998 | 3 | 2 | 61 | 3 | 15 | 0 | 0 | 0 | 2 | 7 | 1 | 6 | 1 | 1 | 1 | 1 | 0 |
| 24999 | 1 | 10 | 37 | 1 | 24 | 0 | 0 | 0 | 4 | 6 | 1 | 0 | 1 | 1 | 0 | 0 | 1 |